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Abstract 

We describe QUAERO, a method that i) enables the automatic optimization of searches for physics 
beyond the standard model, and ii) provides a mechanism for making high energy collider data gen- 
erally available. We apply QUAEro to searches for standard model WW, ZZ, and tt production, and 
to searches for these objects produced through a new heavy resonance. Through this interface, we 
make three data sets collected by the D0 experiment at v'* = 1-8 TeV publicly available. 
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It is generally recognized that the standard model, a 
successful description of the fundamental particles and 
their interactions, must be incomplete. Models that ex- 
tend the standard model often predict rich phenomenol- 
ogy at the scale of a few hundred GeV, an energy regime 
accessible to the Fermilab Tevatron. Due in part to the 
complexity of the apparatus required to test models at 
such large energies, experimental responses to these ideas 
have not kept pace. Any technique that reduces the time 
required to test a particular candidate theory would al- 
low more such theories to be tested, reducing the possi- 
bility that the data contain overlooked evidence for new 
physics. 

Once data are collected and the backgrounds have been 
understood, the testing of any specific model in principle 
follows a well-defined procedure. In practice, this process 
has been far from automatic. Even when the basic selec- 
tion criteria and background estimates are taken from 
a previous analysis, the reintcrprctation of the data in 
the context of a new model often requires a substantial 
length of time. 

Ideally, the data should be "published" in such a way 
that others in the community can easily use those data to 
test a variety of models. The publishing of experimental 
distributions in journals allows this to occur at some lev- 
el, but an effective publishing of a multidimensional data 
set has, to our knowledge, not yet been accomplished by 
a large particle physics experiment. The problem ap- 
pears to be that such data are context-specific, requiring 
detailed knowledge of the complexities of the apparatus. 
This knowledge must somehow be incorporated either in- 
to the data or into whatever tool the non-expert would 
use to analyze those data. 

Many data samples and backgrounds have been defined 
in the context of sleuth , a quasi-model- independent 
search strategy for new high pr physics that has been ap- 
plied to a number of exclusive final states [^,^ in the data 
collected by the D0 detector Q during 1992-1996 in Run 
I of the Fermilab Tevatron. In this Letter we describe a 
tool (quaero) that automatically optimizes an analysis 
for a particular signature, using these samples and stan- 
dard model backgrounds, sleuth and QUAERO are com- 
plementary approaches to searches for new phenomena, 
enabling analyses that are both general (sleuth) and fo- 
cused (quaero). We demonstrate the use of QUAERO in 
eleven separate searches: standard model WW and ZZ 
production; standard model tt production with leptonic 
and semileptonic decays; resonant WW, ZZ, WZ, and tt 
production; associated Higgs boson production; and pair 
production of first generation scalar leptoquarks. The 
data described here are accessible through quaero on 
the World Wide Web [|, for general use by the particle 
physics community. 

The signals predicted by most theories of physics be- 
yond the standard model involve an increased number of 
predicted events in some region of an appropriate vari- 
able space. In this case the optimization of the analysis 
can be understood as the selection of the region in this 



variable space that minimizes cr^^'^", the expected 95% 
confidence level (CL) upper limit on the cross section of 
the signal in question, assuming the data contain no sig- 
nal. The optimization algorithm consists of a few simple 
steps: 

(i) Kernel density estimation Q is used to estimate 
the probability distributions p{x\s) and p{x\b) for 
the signal and background samples in a low- 
dimensional variable space V, where x G V. The 
signal sample is contained in a Monte Carlo file pro- 
vided as input to QUAERO. The background sample 
is constructed from all known standard model and 
instrumental sources. 

(ii) A discriminant function D(x) is defined by § 

D{x) = T^TTT- (1) 

^ ' p{x\s) + p{x\b) ^ ' 

The semi-positive-definiteness oi p{x\s) and p{x\b) 
restricts D{x) to the interval [0, 1] for all x. 

(iii) The sensitivity 5 of a particular threshold Dent on 
the discriminant function is defined as the recipro- 
cal of a'^^"''" . Dent is chosen to maximize S. 

(iv) The region of variable space having D{x) > Dent is 
used to determine the actual 95% CL cross section 
upper limit a^^"''" §. 

When provided with a signal model and a choice of vari- 
ables V, QUAERO uses this algorithm and D0 Run I data 
to compute an upper limit on the cross section of the sig- 
nal. Instructions for use are available from the QUAERO 
web site. 

Table | shows the data available within QUAERO, and 
Table |l| summarizes the backgrounds. These data and 
their backgrounds are described in more detail in Ref. [|. 
The final states are inclusive, with many events contain- 
ing one or more additional jets. Kolmogorov-Smirnov 
tests have been used to demonstrate agreement between 
data and the expected backgrounds in many distribu- 
tions. The fraction of events with true final state objects 
satisfying the cuts shown that satisfy these cuts after 
reconstruction is given as an "identification" efficiency 
(eid). Because electrons are more accurately measured 
and more efficiently identified than muons in the D0 
detector, the corresponding muon channels nfx 2j and 
fifj, 2j have been excluded from these data. 

To check standard model results, we remove WW and 
ZZ production from the background estimate and search 
(i) for standard model WW production in the space de- 
fined by the transverse momentum of the electron {p^) 
and missing transverse energy (^t) in the final state 
e^lpT, and (ii) for standard model Z Z production in the 
space defined by the invariant mass of the two electrons 
{rriee) and two jets {mjj) in the final state ee2j. Re- 
moving tt production from the background estimate, we 
search for this process (iii) in the final state e^r 4j using 
the two variables laboratory aplanarity (A) and X^Pt' 
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Final State 


Selection criteria 




jLdt 




Prj^ > 15 GeV 
I^hM< 1.7 


0.30 


108 ± pb 


e$T 2j 


Pt'^ ' > 20 GeV 

> 30 GeV 
p"/^ > 40 GeV 


0.61 


115±6 pb"^ 


ee2j 


p^^'^'^^'^ > 20 GeV 


0.70 


123 ± 7 pb-i 



TABLE I. A summary of the data available within 
QUAERO, including the selection cuts applied and the efficien- 
cy of identification requirements. The final states are inclu- 
sive, with many events containing one or more additional jets. 
Reconstructed jets satisfy > 15 GeV and |77^gtl< 2.5, and 
reconstructed electrons satisfy > 15 GeV and (|7?Sotl< 1-1 
or 1.5 <|»7dctl< 2.5), where rjdct is the pseudorapidity mea- 
sured from the center of the detector. 





Standard model backgrounds 




Final State 


muhijets W Z VV 


tt 


e/i 


data data isajet pythia 


HERWIG 


e$T 2j 


data VECBOS - PYTHIA 


HERWIG 


ee2j 


data - PYTHIA PYTHIA 



TABLE 11. Standard model backgrounds (often produced 
with accompanying jets) to the final states considered. VV 
denotes WW, WZ, and ZZ; "data" indicates backgrounds 
from jets misidentified as electrons estimated using data. 
Monte Carlo programs (isajet ^ , pythia ^ , herwig Q , 
and VECBOS |l^) are used to estimate several sources of back- 
ground. 



and (iv) in the final state efi^T 2j, using the two variables 
pfp and XIPt' assuming a top quark mass of 175 GeV. 

Including all standard model processes in the back- 
ground estimate, we look for evidence of new heavy res- 
onances. We search (v) for resonant WW production 
in the final state eflr^j, using the single variable meujj 
after constraining rriey and nijj to Mw, and (vi) for res- 
onant ZZ production in the final state ee2j, using the 
variable rueejj after constraining rrijj to Mz- In both 
cases we remove events that cannot be so constrained. 
To obtain a specific signal prediction, we assume that 
the resonance behaves like a standard model Higgs bo- 
son in its couplings to the W and Z bosons. Constraining 
nieu to Mw and rujj to Mz, we use the quality of the 
fit and meujj to search (vii) for a massive W' boson in 
the extended gauge model of Ref. . Using mei> 4j after 
constraining rrie^ to M\y, we search (viii) for a massive 
narrow Z' resonance with Z-like couplings decaying to 
ti~^ W+bW~b-^ ei^Aj. 

Non-resonant new phenomena are also considered. The 
variables nijj and either mj^^ or m^e are used to search 
for a light Higgs boson produced (ix) in association with 
a W boson, and (x) in association with a Z boson. Final- 
ly, we search (xi) for first generation scalar leptoquarks 
with mass 225 GeV in the final state ee 2j using rriee 
and St, the summed scalar transverse momentum of all 



electrons and jets in the event. The numerical results of 
these searches are listed in Table III. Figures and ^ 
present plots of the signal density, background density, 
and selected region in the variables considered. 



Process 


tsig 


b 




Ndata. 


^95% 


X B 


WW ^ eii$T 


0.14 


19.0 ± 


4.0 


23 


1.1 


pb 


ZZ ee2j 


0.12 


19.7 ± 


4.1 


19 


0.8 


pb 


tt e$T 4j 


0.13 


3.1 ± 


0.9 


8 


0.8 


pb 


tt efi^T 2j 


0.14 


0.6 ± 


0.2 


2 


0.4 


pb 


hi75 WW -* e$T 2j 


0.02 


29.6 ± 


6.5 


32 


11.0 


pb 


/i200 WW e$T 2j 


0.07 


66.0 ± 13.8 


69 


4.4 


pb 


h225 WW e^T 2j 


0.06 


43.1 ± 


9.2 


44 


3.6 


pb 


h2oo ZZ —> ee 2j 


0.15 


17.9 ± 


3.7 


15 


0.6 


pb 


h225 ZZ ^ ee2j 


0.15 


18.8 ± 


3.8 


12 


0.4 


pb 


h250 ^ ZZ ^ ee2j 


0.17 


18.1 ± 


3.7 


18 


0.6 


pb 


Wioo -^WZ^ e$T 2j 


0.05 


27.7 ± 


6.3 


29 


3.4 


pb 


W350 ^WZ e$T 2j 


0.23 


22.7 ± 


5.2 


27 


0.7 


pb 


Wl,oo^WZ-*elpT2j 


0.26 


2.1 ± 


0.8 


2 


0.2 


pb 


Z'y^o tt ^ e$T 


0.11 


18.7 ± 


4.0 


20 


1.1 


pb 


Z'^so tt ^ e^T 4.j 


0.14 


18.7 ± 


4.0 


20 


0.9 


pb 


Z'sso tt ef^T 


0.14 


3.8 ± 


1.0 


2 


0.3 


pb 


Whiis e$T 2j 


0.08 


37.3 ± 


8.2 


32 


2.0 


pb 


Zhiis ~* ee2j 


0.20 


19.5 ± 


4.1 


25 


0.8 


pb 


LQ225LQ225 ee2j 


0.33 


0.3 ± 


0.1 





0.07 pb 



TABLE III. Limits on cross section x branching fraction 
for the processes discussed in the text. All final states are 
inclusive in the number of additional jets. The fraction of the 
signal sample satisfying QUAERO's selection criteria is denoted 
Esig; b is the number of expected background events satisfying 
these criteria; and A''data is the number of events in the data 
satisfying these criteria. The subscripts on h, W' , Z' , and 
LQ denote assumed masses, in units of GeV. 

We note slight indications of excess in the searches for 
tt efJr 4j and tt e^Ipr 2j (corresponding to cross 



section x branching fractions oi a x B = 0.39 



+0.21 

'0.19 



pb 



and 0.14lQ j!)g pb) that are consistent with our measured 
ti production cross section of 5.5 ± 1.8 pb Q and known 
W boson branching fractions. Observing no compelling 
excess in any of these processes, limits ox\ a x B are de- 
termined at the 95% CL. As expected, we find these data 
insensitive to standard model ZZ production (with pre- 
dicted a X B ~ 0.05 pb), and to associated Higgs boson 
production (with predicted a x ;B < 0.01 pb). As a check 
of the method, QUAERO almost exactly duplicates a pre- 
vious search for LQLQ — > ee 2j [|l^ . 

QUAERO is a method both for automatically optimiz- 
ing searches for new physics and for allowing D0 to make 
a subset of its data available for general use. In this 
Letter we have outlined the algorithm used in QUAERO, 
and we have described the final states currently available 
for analysis using this method. QUAERO's performance 
on several examples, including both standard model and 
resonant WW, ZZ, and tt production, has been demon- 
strated. The limits obtained are comparable to those 
from previous searches at hadron colliders, and the search 
for W' WZ is the first of its kind. This tool should in- 
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Background density Signal density Selected region Background density Signal density Selected region 
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FIG. 1. The background density (a), signal density (b), 
and selected region (shaded) (c) determined by QUAERO for 
the standard model processes discussed in the text. From 
top to bottom the signals are: WW e^If^r, ZZ ee2j, 
tt ^ elpT 4j, and tt — > e^lpT 2j- The dots in the plots in the 
rightmost column represent events observed in the data. 
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100 
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300 200 „ 
St 

FIG. 2. QUAERO's analysis of signatures involving undis- 
covered particles. From top to bottom the hypothetical sig- 
nals are: /1200 — > ZZ ee2j, Z550 tt ^ elpT^j, 
Whii^j — » efJr^j, and LQ225LQ225 ee2j. Plots (c) of the 
first two rows show the discriminant D (curve), the threshold 
Dcut (horizontal line), and the data (histogram); the region 
with D > Dcut is selected. 



crease the facility with which new models may be tested 
in the future. 
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